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Abstract 



We consider the concept of a local set of inference rules. A local rule set can be automatically 
transformed into a rule set for which bottom-up evaluation terminates in polynomial time. The 
local-rule-set transformation gives polynomial-time evaluation strategies for a large variety of 
rule sets that cannot be given terminating evaluation strategies by any other known automatic 
technique. This paper discusses three new results. First, it is shown that every polynomial-time 
predicate can be defined by an (unstratified) local rule set. Second, a new machine-recognizable 
subclass of the local rule sets is identified. Finally we show that locaUty, as a property of rule 
sets, is undecidable in general. 

Keywords: Descriptive Complexity Theory, Decision Procedures, Automated Reasoning, 

1. Introduction 

Under what conditions does a given set of inference rules define a computationally 
tractable inference relation? This is a syntactic question about syntactic inference 
rules. There are a variety of motivations for identifying tractable inference rela- 
tions. First, tractable inference relations sometimes provide decision procedures 
for semantic theories. For example, the equational inference rules of reflexivity, 
symmetry, transitivity, and substitutivity define a tractable inference relation that 
yields a decision procedure for the entailment relation between sets of ground 
equations [Kozen, 1977], [Shostak, 1978]. Another example is the set of equa- 
tional Horn clauses valid in lattice theory. As a special case of the results in this 
paper one can show automatically that validity of a lattice-theoretic Horn clause is 
decidable in cubic time. 

Deductive databases provide a second motivation for studying tractable infer- 
ence relations. A deductive database is designed to answer queries using simple 
inference rules as well as a set of declared data base facts. The inference rules in a 
deductive database typically define a tractable inference relation — these inference 
rules are usually of a special form known as a datalog program. A datalog program 
is a set of first-order Horn clauses that do not contain function symbols. Any data- 
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log program defines a tractable inference relation [UUman, 1988], [UUman, 1989]. 
There has been interest in generalizing the inference rules used in deductive data- 
bases beyond the special case of datalog programs. In the general case, where 
function symbols are allowed in Horn clause inference rules, a set of inference 
rules can be viewed as a Prolog program. Considerable work has been done on 
"bottom-up" evaluation strategies for these programs and source-to-source trans- 
formations that make such bottom-up evaluation strategies more efficient [Naugh- 
ton and Ramakrishnan, 1991], [Bry, 1990]. The work presented here on local 
inference relations can be viewed as an extension of these optimization techniques. 
For example, locality testing provides an automatic source-to-source transforma- 
tion on the inference rules for equality (symmetry, reflexivity, transitive, and sub- 
stitution) that allows them to be completely evaluated in a bottom-up fashion in 
cubic time. We do not know of any other automatic transformation on inference 
rules that provides a terminating evaluation strategy for this rule set. 

Tractable rule sets also play an important role in type-inference systems for 
computer programming languages [Talpin and Jouvelot, 1992], [Jouvelot and Gif- 
ford, 1991]. Although we have not yet investigated connections between the notion 
of locality used here and known results on tractability for type inference systems, 
this seems like a fruitful area for future research. From a practical perspective it 
seems possible that general-purpose bottom-up evaluation strategies for inference 
rules can be applied to inference rules for type-inference systems. From a theoreti- 
cal perspective we show below that any polynomial-time predicate can be defined 
by a local set of inference rules and that many type-inference systems give polyno- 
mial-time decidable typabiUty. 

A fourth motivation for the study of tractable inference relations is the role that 
such relations can play in improving the efficiency of search. Many practical 
search algorithms use some form of incomplete inference to prune nodes in the 
search tree [Knuth, 1975], [Mackworth, 1977], [Pearl andKorf, 1987]. Incomplete 
inference also plays an important role in pruning search in constraint logic pro- 
gramming [Jaffar and Lassez, 1987], [van Hentenryck, 1989], [McAUester and 
Siskind, 1991]. Tractable inference relations can also be used to define a notion of 
"obvious inference" which can then be used in "Socratic" proof verification sys- 
tems which require proofs to be reduced to obvious steps [McAUester, 1989], 
[Givanetal., 1991]. 

As mentioned above, inference rules are syntactically similar to first-order 
Horn clauses. In fact, most inference rules can be syntactically represented by a 
Horn clause in sorted first-order logic. If i? is a set of Horn clauses, E is a set of 
ground atomic formulas, and $ is a ground atomic formula, then we write E 4> 
if E u i? I- 4> in first order logic. We write rather than \=j^ because we think 
of as a set of syntactic inference rules and as the inference relation gener- 
ated by those rules. Throughout this paper we use the term "rule set" as a synonym 
for "finite set of Horn clauses". We give nontrivial conditions on R which ensure 
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that the inference relation is polynomial-time decidable. 

As noted above, a rule set R that does not contain any function symbols is 
called a datalog program. It is well-known that the inference relation defined by a 
datalog program is polynomial-time decidable. Vardi and Immerman indepen- 
dently proved, in essence, that datalog programs provide a characterization of the 
complexity class P — any polynomial time predicate on finite databases can be 
written as a datalog program provided that one is given a successor relation that 
defines a total order on the domain elements [Vardi, 1982], [Immerman, 1986], 
[Papadimitriou, 1985]. 

Although datalog programs provide an interesting class of polynomial-time 
inference relations, the class of tractable rule sets is much larger than the class of 
datalog programs. First of all, one can generalize the concept of a datalog program 
to the concept of a superficial rule set. We call a set of Horn clauses superficial if 
any term that appears in the conclusion of a clause also appears in some antecedent 
of that clause. A superficial rule set has the property that forward-chaining infer- 
ence does not introduce new terms. We show in this paper that superficial rule sets 
provide a different characterization of the complexity class P. While datalog pro- 
grams can encode any polynomial-time predicate on finite databases, superficial 
rule sets can encode any polynomial-time predicate on first-order terms. Let Q be 
a predicate on first-order terms constructed from a finite signature. We define the 
DAG size of a first-order term t to be the number of distinct terms that appear as 
subexpressions of t} It is possible to show that if 2 can be computed in polyno- 
mial time in the sum of the DAG size of its arguments then Q can be represented 
by a superficial rule set. More specifically, we prove below that for any such predi- 
cate Q onk first-order terms there exists a superficial rule set R such that Q(ti, 
tj^) if and only if Input(?i, tj^) Accept where Input is a predicate sym- 
bol and Accept is a distinguished proposition symbol. Our characterization of the 
complexity class P in terms of superficial rule sets differs from Immerman 's char- 
acterization of P in terms of datalog programs in two ways. First, the result is 
stated in terms of predicates on terms rather than predicates on databases. Second, 
unlike the datalog characterization, no separate total order on domain elements is 
required. 

Superficial rule sets are a special case of the more general class of local rule 
sets [McAUester, 1993]. A set R of Horn clauses is local if whenever 52 4> there 
exists a proof of 4> from L such that every term in the proof is mentioned in E or <I>. 
If R is local then is polynomial-time decidable. All superficial rule sets are 
local but many local rule sets are not superficial. The set of the four inference rules 
for equality is local but not superficial. The local inference relations provide a third 
characterization of the complexity class P. Let Q he a predicate on first-order 
terms constructed from a finite signature. If Q can be computed in polynomial 



1. The DAG size of a term is the size of the Directed Acyclic Graph representation of the term. 
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time in the sum of the DAG size of its arguments then there exists a local rule set R 
such that for any terms ti, tj^ we have that Qiti, t0 if and only if 
1-^ Q(t^ tj^) where 2 is a predicate symbol representing Q . Note that no super- 
ficial rule set can have this property because forward-chaining inference from a 
superficial rule set can not introduce new terms. We find the characterization of 
polynomial time predicates in terms of local rule sets to be particularly pleasing 
because as just described it yields a direct mapping from semantic predicates to 
predicates used in the inference rules. 

Unlike superficiality, locality can be difficult to recognize. The set of four 
inference rules for equality is local but the proof of this fact is nontrivial. Mechan- 
ically-recognizable subclasses of local rule sets have been identified by McAUester 
[McAUester, 1993] and Basin and Ganzinger [Basin and Ganzinger, 2000]. Here 
we introduce a third mechanically-recognizable subclass which contains a variety 
of natural rule sets not contained in either of these earlier classes. We will briefly 
describe the two earlier classes and give examples of rules sets included in our new 
class that are not included in the earlier classes. 

Basin and Ganzinger identify the class of rule sets that are saturated with 
respect to all orderings compatible with the sub term ordering. The notion of satu- 
ration is derived from ordered resolution. We will refer to these rule sets simply as 
"saturated". Saturation with respect to the class of subterm-compatible orders turns 
out to be a decidable property of rule sets. Membership in the [McAUester, 1993] 
class or the new class identified here is only semi-decidable — a rule set is in these 
classes if there exists a proof of locality of a certain restricted form (a different 
form for each of the two classes). Unfortunately, although saturation is decidable, 
it appears that most natural local rule sets are not saturated. Typically they must be 
expanded (saturated) to include derived rules. This saturation process typically 
yields a less efficient decision procedure for the inference relation defined by the 
rules — the decision procedure must use more inference rules in the larger satu- 
rated set. As an example consider the following rules. 



These rules are local and this rule set is in both McAUester's class and the new 
class introduced here. But they are not saturated. Saturation adds (at least) the fol- 
lowing rules. 



A decision procedure based on the larger saturated set would still run in O(n^) 
time, but the added rules significantly impact the constant factors and this is an 
important issue in practice. 



x<y,y<z^ x<z 
x<y^f(x)<f(y) 



(1) 



x<z,y<f(x)^y<f(z) 
z<x, f(x)<y^f(z)<y 



(2) 



Polynomial-time Computation via Local Inference Rules, July 13, 2000 



4 



The mechanically-recognizable subclass of local rule sets introduced in 
[McAUester, 1993] is called the bounded- local rule sets. This subclass is defined 
carefully in the body of this paper for further comparison to the new subclass intro- 
duced here. The set of the four basic rules for equality is bounded-local. As 
another example of a bounded-local rule set we give the following rules for reason- 
ing about a monotone operator from sets to sets. Let Rjr be the following set of 
inference rules for a monotone operator. 

x<x 

x<y,y<z^ x<z (3) 
x<y^f{x)<f{y) 

There is a simple source-to-source transformation on any local rule set that 
converts the rule set to a superficial rule set without changing the relation 
described. For example, consider the above rules for a monotone operator. We can 
transform these rules so that they can only derive information about terms explic- 
itly mentioned in the query. To do this we introduce another predicate symbol M 
(with the intuitive meaning "mentioned"). Let R'f be the following transformed 
version of Rf. 

M(/(x)) => M(x) 
X < y M(x) 
X < y M(3;) 

(4) 

M(x) X < X 
M(x), M^y), M(z), x<y,y<z^ x<z 
M(/(x)), M(/(3;)), X < y /(x) < f{y) 

Note that R'j is superficial and hence bottom-up (forward-chaining) evaluation 
must terminate in polynomial time^. Then to determine if E ? < m we deter- 
mine, by bottom-up evaluation whether {M(0, M(m)}uL t<u. An analo- 
gous transformation applies to any local rule set. 

A variety of other bounded-local rule sets are given [McAUester, 1993]. As an 
example of a rule set that is local but not bounded local we give the following rules 
for reasoning about a lattice. 



2. For this rule set bottom-up evaluation can be run to completion in cubic time. 
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x<x 
x<y,y <z^ x<z 
x<xvy 
y<xvy 

(5) 

x<z,y<z^xvy<z 

xAy<x 

xAy<y 
Z<x, z<y^Z<xAy 

These rules remain local when the above monotonicity rule is added. With or 
without the monotonicity rule, the rule set is not bounded-local. 

In this paper we construct another machine-recognizable subclass of the local 
rule sets which we call inductively-local rule sets. All of the bounded-local rule 
sets given in [McAUester, 1993] are also inductively-local. The procedure for rec- 
ognizing inductively-local rule sets has been implemented and has been used to 
determine that the above rule set is inductively-local. Hence the inference relation 
defined by the rules in (5) is polynomial-time decidable. Since these rules are com- 
plete for lattices this result implies that validity for lattice-theoretic Horn clauses is 
polynomial-time decidable. 

We been able to show that there are bounded-local rule sets which are not 
inductively-local, although our examples are somewhat artificial. We have not 
found any natural examples of local rule sets that fail to be inductively-local. 
Inductively local rule sets provide a variety of mechanically recognizable polyno- 
mial time inference relations. 

In this paper we also settle an open question from the previous analysis in 
[McAUester, 1993] and show that locality as a general property of rule sets is unde- 
cidable. Hence the optimization of logic programs based on the recognition of 
locality is necessarily a somewhat heuristic process. 

2. BASIC TERMINOLOGY 

In this section we give more precise definitions of the concepts discussed in the 
introduction. 

Definition 1. A Horn clause is a first order formula of the form a ... a ^„ 
$ where $ and the are atomic formulas. For any set of Horn clauses R, 
any finite set E of ground terms, and any ground atomic formula 4>, we write 
Z 4> whenever L u U(R) I- 4> in first-order logic where U(R) is the set of 
universal closures of Horn clauses in R . 
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There are a variety of inference relations defined in tliis paper. For any inference 
relation I- and sets of ground formulas L and F we write L I- F if L I- ^ for 
each ^' in F. 

The inference relation can be given a more direct syntactic characteriza- 
tion. This syntactic characterization is more useful in determining locality. In the 
following definitions and lemma, L is a set of ground atomic formulas, and (j) is a 
single ground atomic formula. 

Definition 2. A derivation of <I> from L using rule set ^ is a sequence of ground 
atomic formulas • • • ' such that ^„ is <I> and for each there exists a 

Horn clause 0^ a ... a 0^ ^ ^' in i? and a ground substitution o such that 
a[^'] is and each formula of the form a[0(] is either a member of L or a for- 
mula appearing in earlier than in the derivation. 

Lemma 1: L <I> if and only if there exists a derivation of <I> from L using the 
rule set R. 

The following restricted inference relation plays an important role in the analysis 
of locality. 

Definition 3. We write E l-^>^ $ if there exists a derivation of ^> from E such 
that every term appearing in the derivation appears as a subexpression of <I> or 
as a subexpression of some formula in E. 

Lemma 2: (Tractability Lemma) [McAUester, 1993] For any finite rule set R 
the inference relation is polynomial-time decidable. 

Proof: Let n be the number of terms that appear as subexpressions of ^> or of a 
formula in E. If 2 is a predicate-symbol of k arguments that appears in the 
inference rules R then there are at most formulas of the form Q(si, Sf.) 
such that E l-^>^ Q(si, Sj^). Since R is finite there is some maximum arity k 
over all the predicate symbols that appear in R. The total number of ground 
atomic formulas that can be derived under the restrictions in the definition of 
is then of order n^. Given a particular set of derived ground atomic formu- 
las, one can determine whether any additional ground atomic formula can be 
derived by checking whether each rule in R has an instance whose antecedents 
are all in the currently derived formulas — for a rule with k' variables, there are 
only instances to check, and each instance can be checked in polynomial 
time. Thus, one can extend the set of derived formulas by checking polynomi- 
ally many instances, each in polynomial time; and the set of derived formulas 
can only be extended at most polynomially many times. The lemma then fol- 
lows. □ 

Clearly, if E 4> then E 1-^ 4> . But the converse does not hold in general. By 
definition, if the converse holds then R is local. 
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Definition 4. [McAUester, 1993]: The rule set R is local if the restricted infer- 
ence relation h»^ is the same as the unrestricted relation . 

Clearly, if R is local then 1-^ is polynomial-time decidable. 

3. CHARACTERIZING P WITH SUPERFICIAL RULES 

In this section we consider predicates on first-order terms that are computable in 
polynomial time. The results stated require a somewhat careful definition of a 
polynomial-time predicate on first-order terms. 

Definition 5. A polynomial-time predicate on terms is a predicate Q on one or 
more first-order terms which can be computed in polynomial time in the sum of 
the DAG sizes of its arguments. 

Definition 6. A rule set is superficial if any term that appears in the conclusion 
of a rule also appears in some antecedent of that rule. 

Theorem 1: (Superficial Rule Set Representation Theorem) If 2 is a polyno- 
mial-time predicate on k first-order terms of a fixed finite signature, then there 
exists a superficial rule set R such that for any first-order terms ti, from 
this signature, we have that Q is true on arguments t-^, t]^ if and only if 
INPUT (?!,..., r^t) ACCEPT. 

As an example consider the "Acyclic" predicate on directed graphs — the predi- 
cate that is true of a directed graph if and only if that graph has no cycles. It is 
well-known that acyclicity is a polynomial-time property of directed graphs. This 
property has a simple definition using superficial rules with one level of stratifica- 
tion — if a graph is not cyclic then it is acyclic. The above theorem implies that the 
acyclicity predicate can be defined by superficial rules without any stratification. 
The unstratified rule set for acyclicity is somewhat complex and rather than give it 
here we give a proof of the above general theorem. The proof is rather technical, 
and casual readers are advised to skip to the next section. 

Proof: (Theorem 1) We only consider predicates of one argument. The proof 
for predicates of higher arity is similar. Let 2 be a one argument polynomial- 
time computable predicate on terms, i.e., a predicate on terms such that one can 
determine in polynomial time in the DAG size of a term t whether or not Q{t) 
holds. Our general approach is to construct a database from t such that the prop- 
erty Q of terms can be viewed as a polynomial-time computable property of the 
database (since the term t can be extracted from the database and then Q{t) 
computed. We can then get a datalog program for computing this property of 
the database, given a total ordering of the database individuals, using the result 
of Immerman and Vardi [Immerman, 1986], [Vardi, 1982]. The proof finishes 
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by showing how superficial rules can be given that construct the required data- 
base from t and the required ordering of the database individuals. The desired 
superficial rule set is then the combination of the datalog program and the 
added rules for constructing the database and the ordering. We now argue this 
approach in more detail. 

We first describe the database that will represent the term t. For each subterm 
5 of ? we introduce a database individual c^, i.e., a new constant symbol unique 
to the term s. We have assumed that the predicate Q is defined on terms con- 
structed from a fixed finite signature, i.e., a fixed finite set of constant and func- 
tion symbols. We will consider constants to be functions of no arguments. For 
each function symbol f of n arguments in this finite signature we introduce a 
database relation Pf of n+l arguments, i.e., Pfis a n+l-ary predicate symbol. 
Now for any term t we define Ef to be the set of ground formulas of the form 
Pj^iCff^si, 4„)' ^s^' ■ ■■' '^s ) where /(5 1,. . .,5'„) is a subterm of t (possibly equal to 
t). The set should be viewed as a database with individuals Cg and relations 
Pf. Let r be a set of formulas of the form S(c^, c„) where s and u are subterms of 
t such that S represents a successor relation on the individuals of E,, i.e., there 
exists a bijection p from the individuals of to consecutive integers such that 
S(s, u) is in F if and only if p(m) = p(s) + 1. The result of Immerman and Vardi 
[Immerman, 1986], [Vardi, 1982] implies that for any polynomial-time property 
of the set E^ u F there exists a datalog program R such that E^ u F has the given 
property if and only if E^ u F ACCEPT. Since the term t can be easily recov- 
ered from the set E^, Q can be viewed as a polynomial-time property of E^, and 
so there must exist a datalog program R such that E^ u F ACCEPT if and 
only if Q{t) . We can assume without loss of generality that no rule in R can 
derive new formulas involving the database predicates P^. If R has such rules 
they can be eliminated by introducing duplicate predicates Py', adding rules that 
copy Py facts to Pf' facts, and then replacing Pfhy Pf' in all the rules. 

We now add to the rule set R superficial rules that construct the formulas needed 
in Ef and F — these rules use a number of "auxiliary" relation symbols in their 
computations; we assume the names of these relation symbols are chosen after 
the choice of R so that there are no occurrences of these relation symbols in R. 
First we define a "mentioned" predicate M such that M(s) is provable if and only 
if 5 is a subterm of t. 



The second rule is a schema for all rules of this form where / is one of the finite 
number of function symbols in the signature and x,- is one of the variables xi, 
...,Xf^. Now we give rules (again via a schema) that construct the formula set E^. 



INPUT(0 ^ M(0 



(6) 



M(/(Xj, PfifiXp ...,xJ,Xi, 



(7) 
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Now we write a collection of rules to construct the formula set F, i.e., rules that 
define a successor relation on the terms in t. We start by defining a simple sub- 
term predicate Su such that Su(m, v) is provable if u and v are subterms of t 
such that M is a subterm of v. The second rule is again a schema for all rules of 
this form within the finite signature. 

M(x) Su(x, x) 
M(/(xi, . . ., Suiy, Xi) => Su(y, f{x^ 

We also need the negation of the subterm predicate, which we will call NI for 
"not in". To define this predicate we first need to define a "not equal" predicate 
NE such that NE(m, v) is provable if and only if u and v are distinct subterms of 
the input t. 

M(/(xi, ...,x^)),n{g{y^, ...,yj)^ 

NE(/(jCi, jc„), g{y^, ...,yj) 
M(/(;Ci, . . ., x^, . . ., x^)), Hf{x^, ...,yi,..., NE(;c., y^) 

NE(/(xj, ...,Xi, fixp ...,yf, ...,x„)) 



(9) 



Instances of the first rule schema must have / and g distinct function symbols 
and in the second rule schema Xf and yi occur at the same argument position and 
all other arguments to /are the same in both terms. Now we can define the "not 
in" predicate NI such that Nl(s, u) if s is not a subterm of u. We only give the 
rules for constants and functions of two arguments. The rules for functions of 
other numbers of arguments are similar. Instances of the first rule schema must 
have c a constant symbol. 

NE(x, c) ^ NI(x, c) 
NE(z, fix, y)), NI(z, x), NI(z, y) NI(z, f{x, y)) 

Now for any subterm s of the input we simultaneously define a three-place 
"walk" relation Vl{s, u, w) and a binary "last" relation L(5, u). W(5, u, w) will be 
provable if s and u are subterms of w and u is the successor of 5 in a left-to-right 
preorder traversal of the subterms of w with elimination of later duplicates. L(.y, 
u) will be provable if s is the last term of the left-to-right preorder traversal of 
the subterms of u, again with elimination of later duplicates. In these defini- 
tions, we also use the auxiliary three-place relation W (s, u, v), where W (s, u, 
f(w, v)) means roughly that s and u are subterms of v such that u comes after s in 
the preorder traversal of v and every term between s and u in this traversal is a 
subterm of w. More precisely, W (s, u, v) is inferred if and only if v has the form 
f{x,y) such that there are occurrences of s and u in the pre-order traversal of y 
(removing duplicates within y) where the occurrence of u is later than the occur- 
rence of s and all terms in between these occurrences in the traversal are sub- 
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temis of X. Using W and NI together (see two different rules below) enables 
the construction of a preorder traversal of y with subterms of x removed that can 
be used to construct a preorder traversal of f{x,y) with duplicates removed. 

^ L(c, c) 

M(/(x, y)), Liylast, y), l^l{ylast, x) ^ L(ylast, f(x, y)) 

M(f(x, y)), Su(y, x), L(xlast, x) Lixlast, f(x, y)) (11) 
L(ylast, y), Suiylast, x), NI(y, x), 
VS'iflast, ylast, f(x, y)), l>il(flast, x) Liflast, f(x, y)) 

M(/(x, y)) W(/(x, y), X, f(x, y)) 
Hf(x, y)), W(m, V, x) => W(m, V, fix, y)) 
M(f(x, y)), Nl(y, x), L(s, x) => Vl(s, y, f(x, y)) (^2) 
M(/(x, y)), W(m, V, y) => W'(m, v, /(x, y)) 
¥l'(u, V, f(x, y)), NI(m, x), NI(v, x) W(m, v, f(x, y)) 
W'(m, V, f(x, y)), W'(v. w. /(^. y)). Su(v, x) W'(m, w, f(x, y)) 

Finally we define the successor predicate S in terms of W, as follows. 

INPUT(z), W(x, y, z) => S(x, y) (13) 

Let /?' be the datalog program R plus all of the above superficial rules. We now 
have that u T ACCEPT if and only if INPUT(0 ACCEPT, and the 
proof is complete. □ (Theorem 1) 

4. CHARACTERIZING P WITH LOCAL RULES 

Using the theorem of the previous section one can provide a somewhat different 
characterization of the complexity class P in terms of local rule sets. 

Definition 7. A rule set R is local if for any set of ground atomic formulas L 
and any single ground atomic formula 4>, we have E 4> if and only if 
E l-^>^ $ . We note that the tractability lemma (Lemma 2) implies immediately 
that If R is local then is polynomial-time decidable. 

Theorem 2: (Local Rule Set Representation Theorem) If (2 is a polynomial- 
time predicate on first-order terms then there exists a local rule set R such that 

for any first-order terms . . ., we have that Q is true on arguments ti, tj^ 
if and only if Qitp ■■■,tj^) where Qisa predicate symbol representing Q . 

Before giving a proof of this theorem we give a simple example of a local rule set 
for a polynomial-time problem. Any context-free language can be recognized in 
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cubic time. This fact is easily proven by giving a translation of grammars into local 
rule sets. We represent a string of words using a constant symbol for each word 
and the binary function CONS to construct terms that represent lists of words. For 
each nonterminal symbol A of the grammar we introduce a predicate symbol P4 of 
two arguments where Pj^(x, y) will indicate that x and y are strings of words and 
that y is the result of removing a prefix of x that parses as category A. For each 
grammar production A — > c where c is a terminal symbol we construct a rule with 
no antecedents and the conclusion P4(C0NS(c, x), x). For each grammar produc- 
tion A — > fi C we have the following inference rule: 



Finally, we let P be a monadic predicate which is true of strings generated by the 
distinguished start nonterminal S of the grammar and add the following rule: 



Let R be this set of inference rules, i? is a local rule set, although the proof of local- 
ity is not entirely trivial. The rule set R also has the property that P(x) if and 
only if X is a string in the language generated by the given grammar. General meth- 
ods for analyzing the order of running time of local rule sets can be used to imme- 
diately give that these clauses can be run to completion in order rr' time where n is 
the length of the input string.^ We have implemented a compiler for converting 
local rule sets to efficient inference procedures. This compiler can be used to auto- 
matically generate a polynomial-time parser from the above inference rules. 

Proof: (Theorem 2) We now prove the above theorem for local inference rela- 
tions from the preceding theorem for superficial rule sets. By the superficial 
rule-set representation theorem there must exist a superficial rule set R such that 
for any first order terms ti, tj^ we have that Q(t^, if and only if 

INPUT(?i, tj^ ACCEPT where INPUT is a predicate symbol and 
ACCEPT is a distinguished proposition symbol. For each predicate symbol S of 
m arguments appearing in i? let S' be a new predicate symbol of k+m argu- 
ments. We define the rule set i?' to be the rule set containing the following 
clauses. 



3. An analysis of the order of running time for decision procedures for local inference relations is 
given in [McAUester, 1993]. 



Pg{x, y) A P^iy, z) =^ Pa(x, z) . 



(14) 



Pgix, NIL) ^P(x). 



(15) 
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Input '(xj, ...,x,^,Xp ...,Xi) 
Accept '(xj, . . ., x^) Q(xp . . ., x^) 

S^(x^, .... X^, J, .... ^i^OTj) A ... 

...A S„(Xj, X^, t^^ J, .... ?„^;„^) S (Xp Xf^, S-^, Sj) 

where S^it^^ i, ?i „,^) a ... a 5„a„ j ^n,m„) => "^(^i is in 

Given the above definition we can easily show that S'(tp . . ., 5^, . . ., 5^) if 
and only if Input(?i, tj^) S{s-^, 5^). Therefore, it follows that 
Input(?i, tj^) Accept if and only if Q(ti, tj^). It remains only to 
show that R' is local. Suppose that Z l-^^^. We must show that E l-^>^4>. Let ti, 
tj^he the first k arguments in If $ is Q(ti, tj^ then either ^> is in E (in 
which case the result is trivial), or we must also have Z Accept 
so that it suffices to prove the result assuming that 4> is the application of the 
primed version of a predicate appearing in R. Every derivation based on R' 
involves formulas which all have the same first k arguments — in particular, 
given that Z ^> we must have that E' 4> where Z' is the set of formulas in 
L that have t^, tj^SiS their first k arguments. Let E" and be the result of 
replacing each formula tj^, s^, by S{s]^, s^) in L' and 

respectively. Since E' 4> we must have Input(?i, tj^) u E" But 
since R is superficial every term in the derivation underlying Input(?i, 
u E" 1-^ either appears in some or appears in E". This implies that every 
term in the derivation appears in either E' or 4>, and thus that E H>>^^. □ 
(Theorem 2) 



5. ANOTHER CHARACTERIZATION OF LOCALITY 

In this section we give an alternate characterization of locality. This characteriza- 
tion of locality plays an important role in both the definition of bounded-local rule 
sets given in [McAUester, 1993] and in the notion of inductively-local rule sets 
given in the next section. 

Definition 8. A bounding set is a set Y of ground terms such that every subterm 
of a member of Y is also a member of Y. 

Definition 9. A ground atomic formula ^ is called a label formula of a bound- 
ing set Y if every term in ^ is a member of Y. 

Definition 10. For any bounding set Y, we define the inference relation l-^>^ y 
to be such that E \-^>jiy^ if and only if there exists a derivation of 4> from E 
such that every formula in the derivation is a label formula of the term set Y. 
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We have that E l-^>^ $ if and only if E l-^>^ ^ $ where Y is the set of all terms 
appearing as subexpressions of 4> or of formulas in E. The inference relation l-^>^ y 
can be used to give another characterization of locality. Suppose that R is not local. 
In this case there must exist some E and ^> such that E b'»^ 4> but E <I> . Let Y be 
the set of terms that appear in E and We must have E |^>^ y However, since 
E 1-^ 4> we must have E for some finite superset Y' of Y Consider 

"growing" the bounding set one term at a time, starting with the terms that appear 
in E and 

Definition 11. A one-step extension of a bounding set Y is a ground term a that 
is not in Y but such that every proper subterm of a is a member of Y. 

Definition 12. A feedback event for R consists of a finite set E of ground for- 
mulas, a ground formula 4>, a bounding set Y containing all terms that appear in 
E and 4>, and a one-step extension a of Y such that ^l^>^yu{a}^' 
El^>^j,4). 

By abuse of notation, a feedback event will be written as E h»jj Y<^{a} ^ • 

Lemma 3: [McAllester, 1993]: R is local if and only if there are no feedback 
events for R. 

Proof: First note that if R has a feedback event then R is not local — if 
E y^ 4> then E 4> but if E l^>^ y 4> then E <I>. Conversely sup- 
pose that R is not local. In that case there is some E and <3> such that E \h>j^ ^ 
but E l-^>^ y $ for some finite Y By considering a least such Y one can show 
that a feedback event exists for R. □ 

The concepts of bounded locality and inductive locality both involve the con- 
cept of a feedback event. We can define bounded locality by first defining Cji(L, Y) 
to be the set of formulas ^ such that E y ^ . is bounded-local if it is local 
and there exists a natural number k such that whenever E l-^>^ y^ ^ there 
exists a /:-step or shorter derivation of ^ from Cr(L, Y) such that every term in the 
derivation is a member of Y u {a}. As mentioned above, the set of the four basic 
inference rules for equahty is bounded-local and there exists a procedure which 
can recognize the locality of any bounded-local rule set [McAllester, 1993]. The 
definition of inductive locality is somewhat more involved and is given in the next 
section. 

6. INDUCTIVE LOCALITY 

To define inductive locality we first define the notion of a feedback template. A 
feedback template represents a set of potential feedback events. We also define a 
backward chaining process which generates feedback templates from a rule set R. 
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We show that if there exists a feedback event for R then such an event will be 
found by this backchaining process. Furthermore, we define an "inductive" termi- 
nation condition on the backchaining process and show that if the backchaining 
process achieves inductive termination then R is local. 

Throughout this section we let i? be a fixed but arbitrary set of Horn clauses. 
The inference relation y will be written as with the understanding that R 
is an implicit parameter of the relation. 

We define feedback templates as ground objects — they contain only ground 
first-order terms and formulas. The process for generating feedback templates is 
defined as a ground process — it only deals with ground instances of clauses in R. 
The ground process can be "lifted" using a lifting transformation. Since lifting is 
largely mechanical for arbitrary ground procedures [McAUester and Siskind, 
1991], the lifting operation is only discussed very briefly here. 

Definition 13. A feedback template consists of a set of ground atomic formulas 
E, a multiset of ground atomic formulas T, a ground atomic formula 4>, a 
bounding set Y, and a one-step extension a of Y such that ^> and every formula 
in Z is a label formula of Y, every formula in F is a label formula of Y u {a} 
that contains a, and such that E u F l-^>yi^ ^^^^^ ^> . 

By abuse of notation a feedback template will be written as E, F \-^>Yu{a} ^■ 
F is a multiset of ground atomic formulas, each of which is a label formula of Y u 
{a} containing a, and such that the union of E and F allow the derivation of 4> rel- 
ative to the bounding set Y u {a}. A feedback template is a potential feedback 
event in the sense that an extension of E that allows a derivation of the formulas in 
F may result in a feedback event. The requirement that F be a multiset is needed 
for the template-based induction lemma given below. Feedback templates for R 
can be constructed by backward chaining. 

Procedure for Generating a Template for R: 

1. Let ^1 A ... A ^„ ^ $ be a ground instance of a clause in R. 

2. Let a be a term that appears in the clause but does not appear in the con- 
clusion 4> and does not appear as a proper subterm of any other term in 
the clause. 

3. Let Y be a bounding set that does not contain a but does contain every 
term in the clause other than a. 

4. Let E be the set of antecedents which do not contain a. 

5. Let F be the set of antecedents which do contain a. 

6. Return the feedback template E, F l-^>yi^{oc} ^• 
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We let Tq[R] be the set of all feedback templates that can be derived from R by an 
application of the above procedure. We leave it to the reader to verify that Tq[R] is 
a set of feedback templates. Now consider a feedback template L, T \-»YKj{a} ^■ 
We can construct a new template by backward chaining from L, F \-^>Y(j{a} ^ 
using the following procedure. 

Procedure for Backchaining from L, F l-»yu{a} ^ 

1 . Let be a member of F 

2. Let ^1 A ... A ^„ be a ground instance of a clause in R that has as 
its conclusion and such that each is a label formula of Y u {a}. 

3. Let L' be L plus all antecedents that do not contain a. 

4. Let F' be F minus plus all antecedents that contain a. 

5. Return the template Z', F' l-^>y;^{oc} ^• 

In step 4 of the above procedure, F' is constructed using multiset operations. For 
example, if the multiset F contains two occurrences of 0, then "F minus 0" con- 
tains one occurrence of 0. We need F to be a multiset in order to guarantee that 
certain backchaining operations commute in the proof of the induction lemma 
below — in particular, we will use the fact that if a sequence of backchaining oper- 
ations remove an element of F at some point, then there exists a permutation of 
that sequence of backchaining operations producing the same resulting template, 
but that removes first. 

For any set T of feedback templates we define B[T] to he T plus all templates 
that can be derived from an element of Thy an application of the above backchain- 
ing procedure. It is important to keep in mind that by definition B[T] contains T. 
We let 5"[r] be B[B[- ■ -BIT]]] with n applications of B. 

Definition 14. A feedback template is called critical if F is empty. 

If L, \-^>YKj{a} ^ is a critical template then L '^>yu{a} ^- If ^ ^ then 
E \-^>Y(j{a} ^ is a feedback event. By abuse of notation, a critical template 
E, \-»Yu {a} ^ ^^ch that Z l7^>y 4> will itself be called a feedback event. The fol- 
lowing lemma provides the motivation for the definition of a feedback template 
and the backchaining process. 

Lemma 4: There exists a feedback event for R if and only if there exists a j 
such that BJ[Tq[R]] contains a feedback event. 

Proof: To prove the Lemma 4 suppose that there exists a feedback event for R. 
Let E l^>yu {a} ^ ^ minimal feedback event for R, i.e., a feedback event for 
R which minimizes the length of the derivation of ^> from E under the bounding 
set Y u {a}. The fact that this feedback event is minimal implies that every for- 
mula in the derivation other than <I> contains a. To see this suppose that is a 
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formula in the derivation other than $ that does not involve a. We then have 
L h>>y^ {oj} and E u {0} H>>yi^ {oc} ^- these two must be a feedback 

event — otherwise we would have L 4>. But if one of these is a feedback 
event then it involves a smaller derivation than L '^>yu{a} ^ contra- 
dicts the assumption that E l-^>y^j{oc} ^ minimal. Since every formula other 
than 4> in the derivation underlying Z \-»Yu{a} ^ contains a, the template 
L, H>>7u{oc} ^ can be derived by backchaining steps mirroring that deriva- 
tion. □ 

The above lemma implies that if the rule set is not local then backchaining will 
uncover a feedback event. However, we are primarily interested in those cases 
where the rule set is local. If the backchaining process is to establish locality then 
we must find a termination condition which guarantees locality. Let T be a set of 
feedback templates. In practice T can be taken to be B^[Tq[R]] for some finite j. We 
define a "self-justification" property for sets of feedback templates and prove that 
if ris self -justifying then there is no n such that B"[T] contain a feedback event. In 
defining the self -justification property we treat each template in T as an indepen- 
dent induction hypothesis. If each template can be "justified" using the set of tem- 
plates as induction hypotheses, then the set T is self -justifying. 

Definition 15. We write L, F h»7^ y <I> if 7 contains templates 

^2' ^2 '^>yu{a} ^^^^ 

where each L^- is a subset of E, each r^- is a subset of T and E u {^j, 

Definition 16. A set of templates T is said to justify a template 
E,r K> 

Yu{a} ^ there exists a e F such that for each template E', F' 
\-»Yu {a} ^ generated by one step of backchaining from E, F \-^>Y^{(y,} ^ 
selecting at step 1 of the backchaining procedure we have E', F' h»7.y ^. 

Definition 17. The set T is called self-justifying if every member of T is either 
critical or justified by T, and Tdoes not contain any feedback events. 

Theorem 3: (Template-based Induction Theorem) If T is self -justifying then 
no set of the form B'\T] contains a feedback event. 

Proof: Consider a self -justifying set T of templates. We must show that for 
every critical template E, l-^>yi^{oc} ^ B'^[T] we have that E h»y 4>. The 
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proof is by induction on n. Consider a critical template E, l-^>yu{a} ^ 
B\T\ and assume the theorem for all critical templates in B\T for j less than n. 
The critical template E, l-»y^{oc} ^ must be derived by backchaining from 
some template E', F' h»y^ {^j} ^ in T. Note that E' must be a subset of E. If F' 
is empty then E' equals E and E l-^>y 4> because T is self-justifying and thus 
cannot contain any feedback events. If F' is not empty then, since Tis self -justi- 
fying, we can choose a in F' such that for each template E", F" \-^>Y^{a} ^ 
derived from E', F' l-^>yu {q^} ^> by a single step of backchaining on we have 
E", F" \-»jY ^- We noted above that backchaining operations commute (to 
ensure this we took F to be a multiset rather than a set). By the commutativity 
of backchaining steps there exists a backchaining sequence from 
E', F' l-^>yi^{oc} ^ to E, l-^>y^{oc} ^ such that the first step in that sequence 
is a backchaining step on 0. Let E*, F* h»yi^ {g^} ^ be the template that results 
from this first backchaining step from E', F' ^^^^ Note that E* is a sub- 

set of E. We must now have E*, F* h»j y 4>. By definition, Tmust contain tem- 
plates 

E^, Fj H>>yu{oc} ^1 

Ej, F2 l-^>yu{a} ^2 ^^g^ 

'^k' l^^yuia} ^k 

such that each E,- is a subset of E*, each F,- is a subset of F*, and E* u {, 

h»y 4>. Note that each E^- is a subset of E. Since F, is a subset of F* 
there must be a sequence of fewer than n backchaining steps that leads from 
E-, F- H>>yi^{oc} to a critical template E'^-, H>>yi^{(x} such that E'^- is a 
subset of E. This critical template is a member of B^{T] for j less than n and so 
by our induction hypothesis this template cannot be a feedback event; as a con- 
sequence we have E'^ l-»y and thus E l-^>y m^. But if E l-^>y for each m^, 
and Eu {^1, ^2. ^A;} '^>y then E K>y 4>. □ (Template-based Induc- 
tion Theorem) 

The following corollary then follows from Theorem 3 along with Lemmas 3 and 4: 

Corollary 1: If B\Tq{KW is self-justifying, for some n, then R is local. 
We now come the main definition and theorem of this section. 

Definition 18. A rule set R is called inductively-local if there exists some n 
such that B"[ro[i?]] is self-justifying. 

Theorem 4: There exists a procedure which, given any finite set R of Horn 
clauses, will terminate with a feedback event whenever R is not local, terminate 
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with "success" whenever R is inductively-local, and fail to teraiinate in cases 
where R is local but not inductively-local. 

The procedure is derived by lifting the above ground procedure for computing 
B'^[Tq[R]]. Lifting can be formalized as a mechanical operation on arbitrary nonde- 
terministic ground procedures [McAUester and Siskind, 1991]. In the lifted version 
the infinite set BJ[Tq[R]] is represented by a finite set of "template schemas" each 
of which consists of a template expression L, F \-^>Y(j{a} ^ involving variables 
plus a set of constraints on those variables. We have implemented the resulting 
lifted procedure and used it to verify the locality of a variety of rule sets, including 
the rule set given as equation (5) above for reasoning about lattices. This procedure 
is also useful for designing local rule sets — when applied to a nonlocal rule set 
the procedure returns a feedback event that can often be used to design additional 
rules that can be added to the rule set to give a local rule set computing the same 
inference relation. 

7. LOCALITY IS UNDECIDABLE 

We prove that locality is undecidable by reducing the Halting problem. Let M be a 
specification of a Turing machine. We first show one can mechanically construct a 
local rule set R with the property that the machine M halts if and only if there exists 
a term t such that H(t) where His a monadic predicate symbol. Turing machine 
computations can be represented by first-order terms and the formula H(t) intu- 
itively states that ? is a term representing a halting computation of M. 

To prove this preliminary result we first construct a superficial rule set S such 
that M halts if and only if there exists a term t such that INPUT(0 H(t). The 
mechanical construction of the superficial rule set S from the Turing machine M is 
fairly straightforward and is not given here. We convert this superficial rule set S to 
a local rule set R as follows. For each predicate symbol Q of m arguments appear- 
ing in S let Q' be a new predicate symbol of m+l arguments. The rule set R will be 
constructed so that Q(t, Si, 5^) if and only if INPUT(0 Q(t, 5^, s^). 
We define the rule set R to be the rule set containing the following clauses: 

Input '(x, x) , H\x, x) H{x) , and each clause of the form 

^1, 1' ^i,wi) ••• (19) 

•••A 2n(^' Vl' •••'^n,m„)=>^'(^'*l' •••'*;) 

where Q^{t^ p...,t^ ,^^)A...AQJ,t^ p...,t^^^)^W{sp...,sp is in R. By 
the design of R we can easily show that Q'{t, s^, 5^) if and only if INPUT(?) 

Q(t, Si, 5^), and so it directly follows that INPUT(0 1^ H(t) if and only if 
h, H(t) . So the Turing machine M halts if and only if K H(t) for some term t, as 
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desired. The proof that the rule set R is local closely follows the proof that R' is 
local in the Local Rule Set Representation Theorem proven above (Theorem 2). 

We have now constructed a local rule set R with the property that M halts if and 
only if there exists some term t such that H(t) . Now let R' he R plus the single 
clause H{x) HALTS where HALTS is a new proposition symbol. We claim that 
R' is local if and only if M does not halt. First note that if M halts then we have 
both HALTS and l7'^»^ HALTS so R is not local. Conversely, suppose that M 
does not halt. In this case we must show that i?' is local. Suppose that L 1-^^. We 
must show that Z Suppose 4> is some formula other than HALTS. In this 

case L l-^4> is equivalent to E ^ . Since R is local we must have E 4> and 
thus E Now suppose 4> is the formula HALTS. If HALTS is a member of E 

then the result is trivial so we assume that HALTS is not in E. Since E HALTS 
we must have E \-j^H(c) for some term c. This implies that E H(c) and thus 
E h»^ H(c) and E H(c). To show E h»^HALTS it now suffices to show that c 
is mentioned in E. By the preceding argument we have E H(c) . Since the rule 
set R was generated by the construction given above, we have that every inference 
based on a clause in R is such that every formula in the inference has the same first 
argument. This implies that E' h»^ H(c) where E' is the set of all formulas in E 
that have c as a first argument. We have assumed that M does not halt, and thus 

H(c). Hence E' must not be empty. Since every formula in E' mentions c, and 
E' is contained in E, we can conclude that E must mention c — thus since 
E h»^ H(c) we have E K>^HALTS . 

8. OPEN PROBLEMS 

In closing we note some open problems. There are many known examples of rule 
sets which are not local and yet the corresponding inference relation is polyno- 
mial-time decidable. In all such cases we have studied there exists a conservative 
extension of the rule set which is local. We conjecture that for every rule set R such 
that is polynomial-time decidable there exists a local conservative extension of 
R. Our other problems are less precise. Can one find a "natural" rule set that is 
local but not inductively local? A related question is whether there are useful 
machine recognizable subclasses of the local rule sets other than the classes of 
bounded-local and inductively-local rule sets? 
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